Removal of interfering strokes in double-sided document images
نویسندگان
چکیده
This paper addresses a special problem with historical document images where handwritten characters from the reverse side appear as noise on the front side and even interfere with the front side characters. A novel method to extract clear textual images from interfering and overlapping areas of text is presented here. The proposed algorithm is interesting in that, with an observation that the edges of the sipping strokes from the reverse side are not as sharp as those on the front side, it adopts an edge detection approach to suppress unwanted background patterns. By further concentrating on the orientation of the strokes, other remaining long and strong noisy edges are removed by using an orientation filter and a size filter. The proposed method proves to perform well regardless of the intensity differences between the foreground writing and the interfering strokes. The segmentation results of real images are shown and evaluated.
منابع مشابه
Directional Wavelet Approach to Remove Document Image Interference
In this paper, we propose a directional wavelet approach to remove images of interfering strokes coming from the back of a historical handwritten document due to seeping of ink during long period of storage. Our previous work required mapping of both sides of the document in order to identify the interfering strokes to be eliminated. Perfect mapping, however, is difficult due to document skews,...
متن کاملA wavelet approach to double-sided document image pair processing
In this paper, we present a novel method for processing double-sided historic handwritten documents using wavelets. The method is specially designed to remove the interfering strokes from the reverse side due to ink sipping through pages after long periods of storage. The proposed method works by first matching both sides of a document page such that the interfering strokes are mapped with the ...
متن کاملMatching of Double-Sided Document Images to Remove Interference
The National Archives of Singapore keeps a large volume of historical handwritten documents. One common problem with the archives is that over the years, ink sipped through the pages of these documents such that characters on the reverse side become visible and interfere with the characters on the front side. This paper addresses this problem and develops a novel algorithm to extract clear text...
متن کاملSegmentation and Analysis of Double-Sided Handwritten Archival Documents
Historical handwritten documents are preserved in good condition in many national archives or libraries. One problem that many archivists are facing is the sipping of ink through the pages of certain double-sided handwritten documents after long periods of storage. This paper addresses this problem and develops a novel algorithm to extract clear textual images from interfering and overlapping a...
متن کاملCharacter Extraction from Interfering Background - Analysis of Double-Sided Handwritten Archival Documents
The sipping of ink through the pages of certain double-sided handwritten documents after long periods of storage poses a serious problem to human readers or OCR systems. This paper addresses this problem through the recovery of content on the front side of a page from the interfering image caused by the handwriting on the reverse side. First, by adapting the Gaussian stochastic model, the inter...
متن کامل